Picture for Qiang Liu

Qiang Liu

Linda

RC-GRPO: Reward-Conditioned Group Relative Policy Optimization for Multi-Turn Tool Calling Agents

Add code
Feb 03, 2026
Viaarxiv icon

Q Cache: Visual Attention is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

NAG: A Unified Native Architecture for Encoder-free Text-Graph Modeling in Language Models

Add code
Jan 30, 2026
Viaarxiv icon

ToolWeaver: Weaving Collaborative Semantics for Scalable Tool Use in Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

DiaDem: Advancing Dialogue Descriptions in Audiovisual Video Captioning for Multimodal Large Language Models

Add code
Jan 27, 2026
Viaarxiv icon

Evaluating and Achieving Controllable Code Completion in Code LLM

Add code
Jan 22, 2026
Viaarxiv icon

What Should I Cite? A RAG Benchmark for Academic Citation Prediction

Add code
Jan 21, 2026
Viaarxiv icon

From Completion to Editing: Unlocking Context-Aware Code Infilling via Search-and-Replace Instruction Tuning

Add code
Jan 19, 2026
Viaarxiv icon

Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation

Add code
Jan 16, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon